Hi! I am an AI/ML Research Manager at S-Lab, Nanyang Technological University, supervised by Prof. Ziwei Liu. I received my B.Eng. in Computer Science from NTU in 2024 with First Class Honours (Highest Distinction).

My research focuses on Agent Systems (Agent Memory, Agentic RL, RAG) and Foundation Models (Multimodal LLMs, Benchmarking and Evaluation).

News

  • 2026.04:   One paper accepted to ACL 2026 Main Conference (Video-MMMU).
  • 2026.02:   One paper accepted to CVPR 2026 (OpenMMReasoner).
  • 2025.06:   One paper accepted to Findings of NAACL 2025 (LMMs-Eval).
  • 2025.01:   Released Video-MMMU benchmark, featured in OpenAI GPT-5 and Google Gemini 3.0 official releases.
  • 2024.08:   Joined NTU S-Lab as AI/ML Research Manager.
  • 2024.06:   Graduated from NTU with First Class Honours (Highest Distinction).

Selected Publications

ACL 2026
Video-MMMU

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Kairui Hu, Penghao Wu, Fanyi Pu, Wang Xiao, Xiang Yue, Bo Li, Yuanhan Zhang, Ziwei Liu

[arXiv] [Project] [GitHub]

TL;DR: A video reasoning benchmark for LMMs, evaluating knowledge acquisition from multi-discipline professional videos. Featured in OpenAI GPT-5 and Google Gemini 3.0 official releases. Adopted by Google DeepMind, OpenAI, Alibaba, ByteDance, and many others.

CVPR 2026
OpenMMReasoner

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Kaichen Zhang, Keming Wu, Zuhao Yang, Bo Li, Kairui Hu, Bin Wang, Ziwei Liu, Xingxuan Li, Lidong Bing

[arXiv] [GitHub]

TL;DR: An open and general recipe for pushing the frontiers of multimodal reasoning, achieving strong performance across comprehensive multimodal reasoning benchmarks.

NAACL 2025
LMMs-Eval

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Kaichen Zhang, Bo Li, Peiyuan Zhang, Fanyi Pu, Joshua Adrian Cahyono, Kairui Hu, Shuai Liu, Yuanhan Zhang, Jingkang Yang, Chunyuan Li, Ziwei Liu

[arXiv] [GitHub]

TL;DR: A unified evaluation framework supporting 100+ tasks across text, image, video, and audio for 30+ multimodal models. 3.8k+ GitHub stars. Widely adopted across the GenAI community for model development and benchmarking.

Honors and Awards

  • 2024: NTU Information Technology Management Association (ITMA) Gold Medal cum Book Prize
  • 2022, 2023: NTU President Research Scholar (with Merit), URECA Undergraduate Research Programme
  • 2020-2022: Dean’s List (Top 5%), College of Computing and Data Science, NTU
  • 2019-2024: NTU Science and Engineering Undergraduate Scholarship (SM2), Ministry of Education Singapore

Education

  • 2020.08 - 2024.06, B.Eng. in Computer Science, Nanyang Technological University, Singapore. GPA: 4.83/5 (First Class Honours, Highest Distinction).